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, Abstract 



In this article we provide some nonnegative and positive estimators of the mean 
squared errors(MSEs) for shrinkage estimators of multivariate normal means. Pro- 
posed estimators are shown to improve on the uniformly minimum variance unbiased 
estimator (UMVUE) under a quadratic loss criterion. A similar improvement is also 
t obtained for the estimators of the MSE matrices for shrinkage estimators. We also 

apply the proposed estimators of the MSE matrix to form confidence sets centered 
at shrinkage estimators and show their usefulness through numerical experiments. 
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1 Introduction. 

Since Stein [15] and James and Stein [9] proved the inadmissibility of the sample mean 
as an estimator of a multivariate normal mean and demonstrated the superiority of the 
James-Stein estimator, considerable efforts have been devoted to develop the theory of the 
shrinkage estimation in theoretical literatures. In step with the development, shrinkage 
estimators have been applied to some practical problems. For example, see Fay and 
Herriot[7], Battese, Harter and Fuller [3] for the small-area estimation and Adkins and 
Eells[l] for the estimation of energy demand. As pointed out in Wan et al.[17], however, 
shrinkage estimation does not necessarily find many applications in empirical work. This 
is considered mainly because relatively little attention has been paid to evaluate the 
precision of shrinkage estimators. This issue is important not only for recognizing how 
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much it pays to use shrinkage estimators for a given problem but also for constructing 
confidence sets. 

Joshi[10] proved the existence of confidence sets of a multivariate normal mean centered 
at shrinkage estimators which show higher coverage probabilities than the conventional 
one, a sphere centered at the observations, when the variance is known. Hwang and 
Casella[8] give a explicit proof that when the variance is known, a confidence set centered 
at the positive-part Stein estimator has uniformly higher coverage probability than the 
conventional one. 

Carter et al. [4] discuss this problem with unknown variance in the context of linear 
regression models and proposed to use the estimators of the MSE and MSE matrix of 
the James-Stein estimator of regression coefficients to evaluate the precision of it. They 
derived the UMVUEs of the MSE and the MSE matrix for the James-Stein estimator 
and proposed to use the UMVUE of the MSE matrix to form a confidence set. They also 
showed that the proposed confidence set has an asymptotically smaller volume than that 
of the conventional one based on F statistics. Wan et al.[17] generalized their argument to 
a wider class of shrinkage estimators. As clarified in Section 2, however, the UMVUEs of 
the MSE and the MSE matrix of a shrinkage estimator have a serious drawback that they 
cannot always take positive or positive definite. Hence in a practical sense, the results in 
Carter et al.[4] and Wan et al.[17] may still not be suitable for empirical work. 

Kubokawa and Srivastava[13] focused their attention on the drawback of the UMVUEs 
and considered the estimation of the MSEs and the MSE matrices from a decision the- 
oretical viewpoint. They defined quadratic loss functions for evaluating estimators and 
proposed positive estimators of the MSE and the MSE matrix of the James-Stein estima- 
tor improving on the UMVUE under the loss functions. 

In this article we take a similar approach to the one in Kubokawa and Srivastava[13] 
and extend their argument to the MSEs and the MSE matrices of a wider class of shrinkage 
estimators. We propose nonnegative and positive estimators of the MSEs and the MSE 
matrices of shrinkage estimators improving on the UMVUEs. We also apply the proposed 
estimators of the MSE matrices to form confidence sets centered at shrinkage estimators 
and show that they have higher coverage probabilities than the conventional one through 
Monte Carlo studies. 

The organization of this article is as follows. In Section 2 we summarize some pre- 
liminary facts on shrinkage estimators and their MSEs and MSE matrices. Section 3 
and Section 4 provide some nonnegative and positive estimators of the MSEs and the 
MSE matrices of shrinkage estimators improving on the UMVUEs. In Section 5.1 we 
give some Monte Carlo studies to confirm the theoretical results on the dominance rela- 
tionship between the proposed estimators and the UMVUEs. In Section 5.2, we apply 
the proposed estimators of MSE matrices to form confidence sets centered at shrinkage 
estimators and show their usefulness through Monte Carlo studies. We end this paper 
with some concluding remarks in Section 6. 
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2 Preliminary facts on the estimation of MSE and 
MSE matrix for shrinkage estimators 

Let X and S be random variables which are independently distributed as 

X~N p (0,a%), S~a 2 xl (1) 

where N p (0, <J 2 I p ) denotes p-variate normal distribution with a mean 6 and a covariance 
matrix a 2 I p and xl, denotes a chi-square variable with n degrees of freedom. Suppose 
that both 6 and a 2 are unknown and that p > 3. We write W = \\X\\ 2 /S. In this article 
we consider the following class of shrinkage estimators 

improving on X under squared error loss. When <f>(W) = (p — 2)/(n + 2), 8^ is the 
James-Stein estimator (James and Stein [9]) 



P-2 1 

1_ ^T2 w 



When (j)(W) = max(W / , (p— 2)/(n+2)), 8$ is the positive-part Stein estimator(Baranchik[2]), 



5to = max I 0, 1 — • — ] X. 



We assume that </>(■) and </>'(■) are finite. Most shrinkage estimators in the class (2) which 
have been proposed in literature satisfy the condition. The MSE matrix of 8^ is defined 
by M(<5 ) = E[(fy - 0)(fy - 0)'] and the MSE is its trace, R(8^) = trM(fy). We note 
that since improves on JT, R(8^) < pa 2 . In this article we consider the estimation of 
R(8^) and M(8^). As Carter et al.[4] and Wan et al.[17] suggested, the estimated MSE 
and MSE matrix are useful as precision measures for 8^. Denote by Mq(8$) the UMVUE 
of M(8 ( f ) ). They proposed to use M (^) in a quadratic form which defines a confidence 
set for 0, 

Q(^) = (64 - e)'M^ x {S^ - 0). (3) 

Let r be the orthogonal matrix such that rX = (||X||, 0, . . . , 0)' and let En be the 
p x p matrix which has one for (1,1) element and zero elsewhere. Wan et al.[17] showed 
that when both </>(•) and <//(•) are continuous, M (8^,) is expressed by 

Mh) = --I P -S- 9l (W)-I p + S-g 2 (W)- — — + S ■ ^ 



n v j v **\ > || X || 2 w || X | 



S-T'\^- gi (W)-I p + g 3 (W) ■ E n T, (4) 



n 
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where 



9i(W) = 



i = l,2, ^W=y 2 W + 



<j>\W) 
w ' 



fci(wO 



w ' 



The UMVUE of R(8+) is obtained by 



RoiSJ = trMo(^) = — - 5 • <?(W0 + 



(5) 



where 



/i(W) = (p - 2 



VP 



+ 20'(W). 



In the case of Sjs, Efron and Morris[6] and Carter et al.[4] showed that Rq(8js) and 
M (6js) are expressed by 



JS) 



Ms 



JS\ 



S-V 



pS _ (p-2\ S 
~n ~ \n + 2J W 

p-2 1 



(6) 



{2I P - (p + 2)E 11 ) 



n (n + 2) 2 W 

respectively. We can see that (6) and (7) coincide with (5) and (4) with 4>(W) = (p 
2)/(n + 2). ^i(-) and g 3 (-) of M (6 JS ) is written by 



(7) 



01 (WO 



2(p-2) l_ 
(n + 2) 2 W' 



9s(W) 



(p-2)(p + 2) J_ 
(n + 2) 2 W 



In the case of 8 



JS' 



■) is not continuous. In order to apply the argument in Wan et 
al.[17] to the derivation of Rq(Sj S ) and M (6j S ), it requires some modifications. Rq(5j S ) 
and M (5j S ) are expressed as follows, 



JS) 



-- + SW + C SW n/2 , w < - — -. 
n n + 2 

otherwise, 



(9) 



Ro($ 



JS, 



Mo(S 



S-V 



— - ( - - CiW n/2 ) 7, 
n 



n 



JS) 



+ (W + C 2 W n ' 2 ) E n 



r, w< 



p-2 
n + 2' 



(10) 



{ Mo(Sjs) 



otherwise, 



where 

Co = 2 



/ p 


v- 


4V ( 


V n 


n + 





-n/2 



n n + 2) \n + 2j 7 2 n + 2 \n + 2 

The proofs of (9) and (10) are given in the Appendix. gi(W) and g%(W) of M (8j S ) for 
< (p — 2)/(n + 2) is written by 

g JW) = -- dW n/2 , g 3 (W) = W + C 2 W n/2 . (11) 
n 

When W > (p- 2)/(n + 2), ^(W) and gs(W) of M (<5j S ) is identical to (8). 

It is easy to see from (6) and (9) that both Rq(Sjs) and Ro(8j S ) have an undesirable 
property of taking negative values for small W. When we use an estimated MSE matrix 
to form a confidence set as in (3), it should be positive definite. However we can also see 
from (7) and (10) that both Mo{8js) and Mq(8j S ) are not positive definite for small W. 
In this article we consider the estimation of R(8^) and M(d^) from a decision theoretical 
viewpoint. Kubokawa and Srivastava[13] considered the estimation of the risk reduction 
and the risk reduction matrix of Sjs 

R*(S JS ) = pa 2 - R(Sj S ), M*{5j S ) = a 2 I v - M(S JS ). 

They provided a class of estimators R*(8js) and M*(8js) improving on the UMVUEs of 
R*(8^) and M*(<V), 

Ro(Sjs) = — - Ro(^), M*(6j S ) = -I p - Mo(^) 
n n 

under quadratic losses and satisfying R(djs) = (pS/n) — R*(6js) > and M(5js) = 
(S/n)I p — M*(Sjs) > 0. In the following sections we extend the argument in Kubokawa 
and Srivastava[13] to a wider class of shrinkage estimators 8^ and propose some classes 
of nonnegative and positive estimators of R{8$) and nonnegative definite and positive 
definite estimators of M{8^) improving on the UMVUEs quadratic loss functions. 

Remark 1. Carter et al.[4] and Wan et al.[17] considered this problem in the context of 
linear regression models, 

Y = A(3 + e, 

where Y is the N observations, A is the N x p design matrix and e ~ Nn(0, a 2 In) is 
the disturbance. As mentioned in Kubokawa and Srivastava[13], the setting (1) considered 
here can be interpreted as a canonical form of linear regression models. Let B = (A'A) 1 ^ 2 
denote a p x p matrix such that BB' = A' A. Then X , 6, S and n corresponds to 
B l A'Y , B(3, Y(I N - A(A'Ay 1 A')Y and N - p, respectively. 
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3 Improved estimators of the MSE 

3.1 An improved nonnegative estimator of the MSE 

In this section we provide a nonnegative estimator of R(S^) improving on R (8 ( j > ). For 
evaluating an estimator R(8^), we use the following quadratic loss function, 

L(R(6^) : R(6i)) = (R(6^) - R(S^) 2 . (12) 

Under the loss (12), the truncated estimator R TR (S l f ) ) = max(0, Ro(6^)) obviously domi- 
nates i? (^)- The estimator proposed in this section is shown to dominates R TR (5 ( f ) ) 
Let a(W) be defined by 

a{w) ,^ 9(w) -my 

Then R (5<f,) is rewritten by 

R (6^) = P S (1 - a(W)) /n. (13) 

We introduce the following class of estimators with a function if}(-), 

vS 1 
^^^-(l-^)), ^( W )< . ( M ) 

Then we obtain the following theorem. 
Theorem 1. Let ipo(W) be defined by 

1 \ 1 A n{l + W) 



ifj (W) = max 
Then 



min 1 



a{W) J ' a(W) V n + p + 2 



%^) = -(l-aM) 



mm 



rp (x \ m ^(l + W) 



n + p + z 

improves on RqIS^) under the loss function (12). 

Proof. In the proof of this theorem, we use the similar procedure which Stein[16] used to 
derive the improved truncated estimator of a 2 . From the definition of R(ip; 8$), we have 



(RMS*)- 11(64,)) 

^•a(WO) ^^)" 2 |(^) <W) • R(S^rj}(W) 



(15) 
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Let ip*(W) be the function which minimizes E[(R(ip; S^) — R(8 ( f ) )) 2 | W]. From (15), 
ijj*(W) is expressed by 



1 nE[S/a 2 \W] 1 i?(<^) 



(16) 



Denote by /&(•) and //.(•; A) the density functions of the central and the non-central x 2 
distributions with k degrees of freedom and the non-centrality parameter A = ||0|| 2 /(T 2 . 
Then we have 



E[S/a 2 \W] _ / u 2 f p (uW; X)f n (u)du 



E[(S/a 2 ) 2 \W] J U zf p (uW;\)f n (u)du 

J u 2 f p (uW)f n (u)du _ 1 + w 



< 



f u 3 f p (uW)f n (u)du n + p + 2'' 



(17) 



from the monotone nondecreasingness of f p (uW; X)/f p (uW) in u. R{8^) < pa 2 . Hence 
from (16), (17) and the fact that < R(6+) < pa 2 , ip*(W) satisfies 



a(W) 



n(l + W) 
n + p + 2 



a(W) 



for any A and any W such that a(W) > 0. Denote TZ + = {W \ W > 0} and define A, B 
and C as follows, 



A 



W 



B= iW 



a(W) 
1 



< 1, a{W) > 0, W > 
n(l + W) 



> 1, a(W) > 0, W > 



a(W) \~ n + p + 2 

c = n+\(AuB). 

We note that E[(R(ip] 5$) - i?(<5^)) 2 | W] is convex on ip(W). Therefore, if we set 



ip(W) = min ( 1 



a(W) 



for W e A, 



ip*(W) satisfies ij)*(W) < i/)(W) < 1 for W 6 A, which implies 

E[(%>; 8^) - i?(^)) 2 | G -4] < E[(A)(^) - fl(<5*)) 2 \W E A}. 



Similarly, if we set 

ip(W) = max 



a(W) 



n(l + W) 
n + p + 2 



for W 6 B, 
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ij}*(W) satisfies ip*(W) > ij>(W) > 1 for W e B and then 

E[(R(t/j; 64) - i?(^)) 2 I W £ B] < E[(A)(^) - R(8<p)) 2 \W e B\. (19) 
For W E C, if we set i/j(W) = 1, 

E[(%; <5 ) - /W) 2 I 6 C] = E[(A)(^) - fl(<5*)) 2 |W e C]. (20) 

By combining (18), (19) and (20), we can complete the proof. □ 

We have shown that R(ipo; 8<p) dominates Ro(8t/>) under the loss (12). We note that 
the unbiasedness of R (8^) was not used in the proof. By following the proof, we can see 
that any estimator in the class (14) which does not satisfy 

is improved by the one which is truncated it to satisfy (21). In other words, (21) is a 
necessary condition that R(ip; 8^) is admissible in the class (14). 

In the case of Sjs, a(W) = n(p — 2) 2 /(p(n + 2) 2 W) from (6). Then B for 8j$ is written 

by 



B= IW 
Hence if (p, n) satisfy 



p — 2 J n \ n + p + 2 



p+ 2\ 2 4(n + p + 2) (p-2 x 



<0, 



n J p \n + 2 / 

B = and then R(ipo] 8js) is identical to R TR (8 js). In general, when n and p are large, 
B = and hence R(ipo; 8$) is identical to R TR (5 ( f > ). However if B 7^ 0, R(ipo] 8$) is shown 
to improve on R TR (8<f,). 

Corollary 1. If (p,n) satisfies B 7^ 7 R(i^q\ 8$) improves on R TR (8<p) under the loss 
(12). 

Proof. As mentioned above, R TR {8(^) is identical to R(i/j; 8,p) with 



min I 1, — — — 1 , if W e A, 



a(W) 



otherwise. 



By following the proof of Theorem 1, we can see that R(ipo] 8$) improves on i? Ti? (<5^) 
under the loss (12) when 6=^0. □ 

We present in Figure 1 the shape of l — a(W)i/jo(W) for 5js and 5j S with (p, n) = (5, 5). 
The dotted line and the solid line represent 1 — a(W) and 1 — a(W)ipo(W) , respectively. 
We can see that when R (8 c f } ) < 0, R(ipo] 8$) shrinks towards and even when i?o(5^) > 0, 
R(ipo] 8$) slightly shrinks it towards 0. 
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f .2 0.4 0.6 O.f 



-0.2 - 



-0.2 - 



0.2 / 0.4 0.6 O.f 

/ 

/ 



1 - a{W)ijjo{W) for 5 JS 1 - a(W)rf)o(W) for 5j s 

Figure 1: 1 — a(W)ipo(W) of 8js and Sj S when (p,n) = (5,5) 



3.2 Improved estimation of the risk reduction and positive es- 
timation of MSE 

In the previous section we have shown that R(ipo] 6$) improves on Rq^S^). However we 
can easily see that R(i/)o] 8$) can take zero with positive probability. From a practical 
viewpoint, R(i/)o] 8$) may still be undesirable. In order to obtain positive estimators, we 
take an similar approach to the one in Kubokawa and Srivastava[13] and consider the 
improved estimation of the risk reduction R*(8 ( f > ) = pa 2 — R(S ( j ) ). In accordance with 
Kubokawa and Srivastava[13], for evaluating the estimator we use the following 

quadratic loss function, 

L{R*{5^ R*{8^)) = {fc{6t) - R*{8 (j> )) 2 . (22) 

From (13), the UMVUE of R*{8 (j> ) is R* (8^ = pSa{W)/n. In order to improve on R* (8^), 
we introduce the class of estimators R*(ip] 8(f>) — pSa(W)ip(W)/n. We can see that pS/n — 
R*(4>; 80) is identical to R(i/)] 6$) defined in (14). It is well known that R*{8^)/a 2 depends 
only on A(e.g. Efron and Morris[6]). We suppose that R*(8 ( / > )/a 2 is maximized at A = 0. If 
<j>(W)/W is nonincreasing, 8^ satisfy this condition(e.g. Kubokawafll], Casella[5]). Many 
shrinkage estimators including 8js and 8^ s satisfy this condition. Denote R*(8 ( p)/a 2 with 
A = by a Pjn (<5^). By using the similar argument in Theorem 1, we can obtain the 
following result. 

Theorem 2. Let Wp^S^) be the solution of the equation 

l + W = p(n + p + 2) _ 1 (23) 
a(W) n a p , n (8<j>)' 

Define ip\(W) by 

ib\{W) = max 1, • ' ' . 

y K ' \ ' p n + p + 2 a(W) J 

Then R*(ipii 8<j>) improves on Rq(8^). 
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The proof is similar to the proof of Theorem 1 and is given in the Appendix. 
By using the similar argument in Kubokawa and Srivastava[13], we can also obtain 
the following result. 

Theorem 3. Suppose that 

(i) a(W) is nonincreasing ; 

(ii) ip{W) is nondecreasing and limjy^oo ip(W) = 1 ; 
(Hi) ij;(W) > faiW) where 

V pa(W) J 



Then R*{tp\8^) improves on Rq(S^) under the loss (22). Especially, R*(ip2] 8^) improves 
on £o(fy). 

The proof of this theorem is also given in the Appendix. We note that we can see 
from (6) and (9) that 8js and 8j S satisfies the condition (i). The results of Theorem 2 
and Theorem 3 lead us to propose the following estimators of R(5,p), 



R{lp2] 8 4 



P S 
n 

max 



R*{il>i;S$) 

n ' n \ n + p + 2 p 



n 



max 



V —{l-a{W)), — I 
n n 



a p , n (8 4 
p 



Since a Ptn {8^) < p, R(ip2] 8<f>) is always positive. If p and n satisfy 



n{\ + W p<n (8^) 
n + p + 2 



< 



V 



p,n(<5</>) (<$</>) 



a 



(24) 



R(ipi, 6$) > with probability one. We can prove 7 P , n (<5js) < p/ci P ,n(8 ( j ) )(8js) analyti- 
cally. The proof is given in the Appendix. 

We present 7 P , n (<>js), 7 P ,n(<5j S ), W p , n (6j S ) and W p , n (8j s ) for (p,n) = (5,5), (10,5), 
(5, 10) and (10, 10) in Table 1 and 2. We can see 7 P , n (<5js) < 1 and J Pl n(8j S ) < 1 for 
all cases. Since p/cn Ptn (S ( j ) )(S < p} > 1, 7 P ,n(<5/sO and J P , n (8j S ) satisfy (24). Many shrinkage 
estimators seem to satisfy (24). However it seems to be difficult to prove (24) for general 
6$ analytically. We note that 7 P , n (<5js) < 1 and J p , n (8j S ) < 1 mean that "0i(") does not 
satisfy the conditions in Theorem 3. Hence from Table 1, R(ipi] 6$) is not necessarily 
included in the class of estimators in Theorem 3. 
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Table 1: •y Pt n(8js) and 7 P ,n(8js) f° r some p and n 
(p,n) ' (5,5) (10,5) (5,10) (10,10)" 
1pA S Js) 0.6795 0.7452 0.7774 0.8228 
7p,n(Sj S ) 0-6399 0.7056 0.7484 0.7921 



Table 2: W p , n (8 js) and W Ptn (dj S ) for some p and n 
(p,w) ' (5,5) (10,5) (5,10) (10, 10~ 
Wp,n(<5j S ) 0.6307 1.533 0.3216 0.8102 
W P,n(6js) 0- 5357 1-399 0.2722 0.7427 



So far we considered the improved estimation of R*(8 ( j ) ). Next we consider to evaluate 
the resulting estimators R(ipii &<j>) an d R{^2\ 8$) as estimators of R(8$). Based on the 
fact that (21) is a necessary condition that R(ip;8,p) is admissible in the class (14), we 
can provide estimators improving on R(ipi] 5$) and R(ip2', <5</>) under the loss (12). 

Theorem 4. Suppose that B ^ 0. Let tpJ R (W) and ipJ R (W) be defined by 
^j R (W) = max 

Then R(ipJ R ; 8$) improves on R(i/)j] 8^) under the loss (12). 

Based on the proof of Theorem 1, the proof is easy and omitted. It is clear that 
R(i/jJ r ] 8^) > when R(ipj] 84) > 0. The dominance relation between R(ipJ R ] 8$) and 
Ro(8 ( j ) ) is interesting. But it seems difficult to clarify it at this point. 

4 Improved estimators of the MSE matrix 

4.1 An improved nonnegative definite estimation of the MSE 
matrix 

In this section we provide a nonnegative definite estimator of M(8^) improving on 
Mq{8^). For evaluating an estimator M(8^), we use the following squared loss func- 
tions, 

L{M{8^)- M{8^)) = tr(M(<V) - M(<5 )) 2 . (25) 

We note that L(M(8$) \ M{8^)) is the sum of quadratic losses of each element. In order 
to consider the improvement of M (^), we introduce the following class of estimators 



, run 1 (, n{l + W)\ 

^^nV-^Tp-^) 
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with functions £(■) and //(•), 



m(£, 77; ^) = 5 • r ^ - <?i we w • ( j p - ^n) 

1 n 



where 



+ feW-^(w)77(M/))-^ 11 vr 

£(f > m S <t>) = diag(Z (?7; Zi(£; 84,),..., Zi(£; 8$)), 



(26) 



It is clear that M(£,r]; 8$) with £(W) = 1 and n(W) = 1 is identical to M (8^). 
M(£,rj; 8^) with T)(W) = 1 coincides with the class which Kubokawa and Srivastava[13] 
considered for the James-Stein estimator. In the case of positive-part Stein estimator, 
however, gs(W) — gi(W) is written by 



m (W) - 9l (W) 



n + 2 n \n + 2 



p(p - 2) 2. 

(n + 2) 2 ' W' 



-n/2 



w-~, w< 

n 



p-2 
n + 2' 

otherwise. 



Denote Iq(t} : 8j S ) with rj(W) = 1 by lo(8j S ). Then we can see that Iq(8j S ) < for small 
W. This means that any estimator M(£, rj; 8~^ s ) with rj(W) = 1 does not always take 
nonnegative definite. So we consider to find improved estimators from the class (26). 

We assume that gi(W) is nonincreasing. From (8) and (11), we can see that 8js and 
8~j S satisfy this condition. Let u and v be random variables which distribute as chi-square 
distribution with p + 2j and n degrees of freedom, respectively. Then define (3p)i(8$) 

3>0 



where 



2{p-l)(j>{u/v) (p + 2j- l)b(u/v) 
u/v 

[n + 2)(j) 2 (W) 



p + 2j 
> )'{W)-A(j){W)(j)\W). 



W W 

Figure 2 represents the behavior of (3$~l(j; 8$) of 8j$ and 8j S with (p, n) = (5, 5), (10, 10) 
computed by Monte Carlo simulation. The result of Figure 2 indicates that (3p]l(8js) > 
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(i) (p,ra) = (5,5) (ii) (p,n) = (10,10) 

Figure 2: : fy) of (5 JS and with (p,n) = (5,5), (10, 10) 

and f3l]l.(5j S ) > 0. We obtained the same results numerically with respect to other (p, n) 
and other <5^. 

By using the similar argument in the proof of Theorem 1, we can obtain the following 
theorem. 

Theorem 5. Suppose that f3p]l(8,p) > 0. Let £o(W) and r] (W) be defined by 

1 + W 



Z,o(W) = max 
r)o(W) = min 



min 1, 



ngiiW) J ' gi(W) \n n + p + 1 



"' si(w0 V™ 

JTien AiT(Co, ?7o; 8$) improves on M (^) under the loss (25). 

The proof is similar to the one of Theorem 1 and is given in the Appendix. By following 
the proof of the theorem and by using the same argument in Section 3.1, we can see that 



9i(W) \n n + p + 2 



1+MM <ew< 1 



n 9l (WY 



(27) 



is a necessary condition on M(£,r); 8<j>) to be admissible in the class (26). 

We can easily see that M (£o, Vo] 8$) is always nonnegative definite. However Zi(£o; 8<f>) — 
when £ (W) = l/{ng{W)) and l { Vo ; 6+) = when rj (W) = (1/n + g 3 (W))/g x {W), i.e. 
M (£ , ?7o; 8$) is not always positive definite. As mentioned in the previous section, from a 
practical viewpoint, the inverse matrices of estimators of the MSE matrix are required to 
form a confidence set as (3). In this sense M(£,o, r] ; 6$) may still be undesirable. In the 
next section we provide some positive definite estimators by considering the improvement 
of the UMVUE of the risk reduction matrix M*{8 ( j } ). 
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4.2 An improved estimation of the risk reduction matrix and 
positive-definite estimation of the MSE matrix 

Following the argument in Kubokawa and Srivastava[13], we consider the estimation of 
MSE reduction matrices M*^^) = o 2 I v — M{8^). As a criterion, we use the following 
quadratic loss function, 

L(M*(^); M*(<5 )) = tr(M*(<^) - M*(^)) 2 . (28) 

We consider the following class derived from (26), 

M*(£, n; 6$) = —I p - M(£, r\\ 6$) 
n 

= S ■ T'{ gi {W)iiW) ■ (I p - E n ) - (gs(W) - gi(W) V (W)) ■ E U }T 

= S-T'L*^r ] ;d (f) )r, (29) 



where 



L *(C, V, fy) = diag(/o(r/; 64), 6$),..., ZJ(£; 6$)), 

l* {m 84,) = 9i(W)rj{W) - g 3 (W) = -- i (m 

n 

m *<!,) = 9i(W)£ (W) = - - h(Z; 64,) 

Let Uj v and b(W) be defined as in the previous section. Define (3^1(8^) by 

:=sup^)(j;^), 
i>o 

where 



u/v P + 2j 

Figure 3 represents the behavior of Sjs) and /3^l(j; 5j S ) with (p, n) = (5,5), 

(10,10) computed by Monte Carlo simulation. Table 3 presents /3^l(8js) and /3p\l(Sj S ) 
when (p, n) is (5,5), (5,10), (10,5) and (10,10). Kubokawa and Srivastava[13] proved 

( 2") 

that sup j >Q P P ,n(j; Sjs) are attained at j = 0,1. They also showed numerically that 

sup„>Q Sj S ) are attained at j — 0. The result of Figure 3 is consistent to their 

results. 

Similar to the argument in Theorem 5, we can obtain the following result. 
Theorem 6. Let the solutions of the equations 

n + p + 2 9l (W) ' gi (W)\ m ) n + p + 2 Pp ' n{ 
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(i) (p,ra) = (5,5) (ii) (p,n) = (10,10) 

Figure 3: /3p,n(j : <5^) of <5 js and Sj S with (p, n) = (5, 5) and (10, 10) 



Table 3: P$(6j S ) and ^j(6j s ) for some p and 



n 



(p,n) 


(5,5) 


(10,5) 


(5, 10) 


(10,10) 




0.4260 


0.5704 


0.5008 


0.6718 


/3p,n{Sj S ) 


0.5332 


0.6611 


0.5173 


0.7616 



be denoted by W^ n (S^) and W^ n {8^), respectively. Define £i(W) and 6?/ 



ci(VV) = mm 1, - ■ 777T— , 

K ' V n + p + 2 gi (W) J 



i]i(W) = min 



1 + W^jSt) (2) 



' ^i(W0 V n + p + 2 

TTien Af* (£1,771; <5^) improves on M"q(5^) under the loss (28). 

The proof of this theorem is given in the Appendix. By using the similar procedure 
to Kubokawa and Srivastava[13], we can also obtain the following theorem. 

Theorem 7. Suppose that 

(i) gi(W) is nonincreasing and g%{W) / g\{W) is nondecreasing ; 

(ii) £(•) and rj(-) are nondecreasing and satisfy 

lim £(W) = 1, lim V (W) = 1; 

(Hi) £(W) > f 3 (W) and rj(W) > r) 2 (W) where 

1 P£l{6 4 



£ 2 (W) = min 1, 



gi(W) n + 2 
15 



V2 (W) = mm\l, -— U 3 W + ^ 




9i(W) ' n 

Then M*(C,,rj; 5$) improves on 1X1^(8^) under the loss (28). 

The proof of this theorem is also given in the Appendix. We can easily see from (8) and 
(11) that the condition (i) is satisfied for 5js and Sj S . We note that if g^{W) — gi(W) > 
for all W > 0, t]i(W) = 1 and t]2(W) = 1 with probability one. 

The results of Theorem 6 and Theorem 7 lead us to propose the estimators of the 
MSE matrix M(<5 ) as 



S 

M(&, 7]i, 64,) = -I p - M*(&, rji] 64,), 1 = 1, 2. 



If (p, n) satisfies 



£ / c \ —\~ • ■ f p,n\- h>j /i~v><*\- vi ^ -, 

^ M '= + 2 ^ x ' 

n(l + WgJ^))/#>(* ) 
^ M '= ~n~+p~+2 £ h 

M(£i,r)i] 5<p) is always positive definite and if (p, n) satisfies 

0$(**)/(" + 2)<l/n, 

M(^2,V2', ^0) is always positive. Table 4 present 7p in (<5js), 7p, n (^js) an d 7p,n(^js) when 
(jo, n) is (5, 5), (5, 10), (10, 5) and (10, 10). From Table 3 and 4, we can see that the above 
conditions are satisfied for (p,n) in the tables. In Table 5 we also presents W^ n (Sjs), 
Wp* n (Sj S ) and WJ} n (Sj S ) for the same (p, n) as in Table 4. 

Recall the fact that (27) is a necessary condition on M(£, 77; S^) to be admissible in the 
class (26). By using the similar argument in Theorem 3, we can also obtain the estimators 
improving on M^^rji, Sj,), i — 1,2 under the loss (25). 



Table 4: ^(Sjs), ijjftg) and 7p \fe) 



(p,n) 


(5,5) 


(10,5) 


(5, 10) 


(10,10) 


lp,n( S Js) 


0.4312 
0.4963 
0.2708 


0.6143 
0.6690 
0.2567 


0.5273 
0.6102 
0.4130 


0.7548 
0.8170 
0.4014 


Table 5: W^Sjs) 


, Wt >n (6+ S ) and W* n (5+ S ) 


(p,n) 


(5,5) 


(10,5) 


(5,10) 


(10,10) 




1.4198 
1.2336 
0.2185 


2.6577 
2.4405 
0.3202 


0.7901 
0.6829 
0.1391 


1.472 
1.360 
0.1596 
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Theorem 8. Suppose that (3^%{8^) > 0. Let £j R {W) for i = 1,2 be defined by 
g R (W) = max 

Then if 

I siW V n n + p + ly J 

^{iJ R ^ r li'^<j>) improves on M(^i,rji] S^) under the loss (25). 

The proof of this theorem is easy and omitted. It is obvious that when M(£,i, rjf, 8^) is 
positive definite, M(^f R ,r]i; d^) is also positive definite. The dominance relation between 
M(£j R , r]j; d^) and M" (<^) under the loss (25) is interesting. In the same way as the case 
of the MSE, however, it seems difficult to clarify it at this point. 

5 Monte Carlo studies 

5.1 Risk performance of proposed estimators 

In this section we study the risk performance of the proposed estimators of the MSE and 
the MSE matrix for the positive part Stein estimator through Monte Carlo studies with 
100,000 replications. Figure 4 represents the performance of risks E[L(i?(<5j S ); -R(<5j S ))] of 
Rq(5j S ), R(tpo; Sj S ) , R^f 1 *; 5j S ) and R(ip2 R '^js)- Fig ure 5 represents the performance 
of risks E[L(M(5+ 5 ); M(S+ S ))] of M {S+ S ), M(^ ; S+ s ), M(^ R ; dj s ) and M(^ R ; <5+ 5 ). 
We set 0, a 2 , A, p and n as 

. = (y/Xft, . . . , y/Z/p)' ; 

• a 2 = 1 ; 

• < A < 30 ; 

• (p >n ) = (5,5), (10,5), (5,10) and (10,10) ; 
The summary of experiments is as follows. 

• We can see from the figures that every proposed estimator uniformly improves on 
the UMVUE. 

• In Section 3, we have shown that R(ipo', Sj S ) improves on R (8j S ) under the loss (12). 
However we could not give theoretical proofs that R(il)J R ; 5j S ) and Rfy™] $js) 
improve on Rq(8j S ) under the loss (12) as estimators of the MSE. However we can 
see from Figure 4 that R(ipi R ; (5jg) and R(ip2 R ] &js) seem to uniformly improve on 
Ro(Sj S ) under the loss (12). 



9i(W) 



n 



j-W_ 
p + 1 
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E[i(fi;/i)l 




- - - llWw'Sjs) 

... (PtosiM 

— k™(t,:SJ a ) 



fan) = (5,5) 



(p,n) = (10,5) 





(p,n) = (5,10) (p,n) = (10,10) 

Figure 4: Estimated risks of the UMVUE and proposed estimators of R(Sj S ) 




fan) = (5,5) 



(p,n) = (10, 5) 





fan) = (5, 10) fan) = (10,10) 

Figure 5: Estimated risks for the UMVUE and proposed estimators of M(S^ S ) 
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• Also in the case of estimators of the MSE matrix, we can see from Figure 5 that 
M($[ R \ 8~j S ) and M(tp2 R ] 8js) w hich are derived from improved estimators of 
M*(dj S ) seem to improve on M (6j S ) under the loss (25). 

• When A is small, proposed estimators show large rate of risk reduction. Conversely, 
as A gets larger, the rate of risk reduction of the proposed estimators gets smaller. 



5.2 Confidence sets centered at shrinkage estimators 

In this section we study the performance of confidence sets (3) centered at the positive 
part Stein estimator. Let c = F 095 (p, n) be 95% percentile of F distribution with degrees 
of freedom p and n. Then the conventional 95% confidence set of 6 is written by 

(x - eyjx - o)/p 

C : Qo = < c. 

S/n 

In what follows, we write Mi = M(C,f R , r)\] S^ s ) and M 2 = M(^2 R , V2] 8js) f° r notational 
simplicity. Following Carter et al.[4] and Wan et al.[17], we first consider the following 
three alternatives to Co, 

ftft, - (3VMrVi5t - G) 

< c, 





: Qi = 




-e)>M^{8+ s - 


0) 




p 




c 2 


: Q 2 = 




-e)'M 2 -\8%- 


0) 




p 




c 3 


: Q 3 = 




- 0)>(8j s - 0)/p 


< 




S/n 



< c, 



Let VoliCi) denote the volume of Cj, i.e. 



Vol(Co) = Vol(Q 



3 J 



r(p/2 + i; 



Define V\ and V2 by 

_ ElVoljC^] _ EjVolid)} _ E[Vol(C 2 )] _ E[Vol(C 2 )} 

1 E[Vol(C )] E[Vol(C 3 )Y 2 E[Vol(C )] E[Vol(C 3 )\ 



Figure 6 represent coverage probabilities of Co to C3, V\ and V 2 computed through 
Monte Carlo method with 100,000 replications. We set 0, a 2 , A, p and n as the experiments 
in the previous section. From the figures, we can see that when A is small, the coverage 
probabilities of C\ to C 3 are larger than that of C . When (p, n) is (5, 5) and (10, 5), the 
coverage probabilities of C\ to C 3 are larger than 95% for < A < 30. When (p,n) is 
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Pr(ft- < e) 

0.975 I 



Pr(Qi < c) for fan) = (5,5) 



Pr(Q; < c) for (p,n) = (10,5) 



Pr(Q, < c) for (p,n) = (5,10) 




V\ and V 2 for (p, n 




Vi and V2 for (p, n) = (10, 5) 




Vi and V2 for (p,n) = (5, 10) 




as so 



Pr(Qi < c) for (p, n) = (10, 10) and 1/ 2 for (p, n) = (10, 10) 

Figure 6: Coverage probabilities of Qo to Q 3 and Vi and Vi- 
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(p,n) = (5,5) fan) = (10,5) 



--- cr 

--- CJ 

— c 3 



fan) = (5, 10) fan) = (10,10) 

Figure 7: Coverage probabilities of C* and C? 

(5, 10) and (10, 10), however, the coverage probabilities of C\ and drop to less than 
95% as A gets larger. The expected volumes of C\ and Ci are far smaller than those of 
Cq and C3. These results suggest that when A and n is small, C\ and C2 seem to have 
higher coverage probabilities and smaller volumes than Cq. On the other hand, while the 
expected volume of C3 is larger than that of C\ and C2, the coverage probabilities of C3 
is always larger than 95%. In this sense, C3 may be the most practical among Co to C3. 
So we next consider the following confidence sets, 

Ws-eyM^Ws-e) ^ (s/n)- c 

V ~ I Mil 1 /* ' 
{tts-B)'Mi\8* s -B) < (S/n)-c ^ 

V ~ \M 2 \ 1/p ' 

Then we have Vol{C{) = Vol(C%) = Vol(C ) = Vol(C 3 ). Denote c* = (S/n) ■ c/lM^. 
Figure 7 represents the behavior of the coverage probabilities of Co, C*, C| and C3. The 
behavior of the coverage probabilities of C* and C\ are almost the same and two dashed 
lines which represent the behavior of them overlap each other and look like just one line. 
We can see from Figure 7 that the coverage probability of C\ and C? are larger than that 
of Co and larger or at least the same level as that of C3. Especially when A is small, the 
coverage probability of C* and C\ are considerably larger than that of C and C 3 . Since 
the volumes of C\ and CJ are equal to C and C 3 , we can say that C\ and C\ are useful 




C* : Ql 

2 • V2 
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for practical use. 



6 Concluding remarks 

In this article we considered the estimation of the MSE and the MSE matrix of shrinkage 
estimators from a decision theoretical viewpoint. We generalize the argument of Carter 
et al.[4], Wan et al.[17] and Kubokawa and Srivastava[13] and provided some nonnegative 
and positive estimators improving on the UMVUE. We also proposed some confidence 
sets by using the positive definite estimator of the MSE matrices and showed through 
Monte Carlo studies that proposed confidence sets have equal volume and attain higher 
coverage probabilities than the conventional one. These results suggest that the proposed 
estimators are useful for evaluating the precision of shrinkage estimators. 

Carter et al. [4] and Wan et al.[17] derived asymptotic coverage probabilities of their 
proposed confidence region. By using their argument, we may provide better confidence 
region. 

The argument in this article may also be available to the estimation of MSE and MSE 
matrix in the small-area problem discussed in Prasad and Rao[14]. Since the small-area 
estimation is an illustrative example of the application of shrinkage estimators(e.g. Fay 
and Herriot[7]), it would be interesting if we could extend the results here to the problem 
We leave these problem to our future research. 



Appendix 

A The derivation of (9) and (10) 

We first derive (9). Following Efron and Morris(1976), 



R{S<t,) 



a 2 E 



a 2 E 



V 



2(p - 2)^P- - (n + 2)^|P + 4 ( j>'(W) + 4cj>(W)(j) > (W) 



p-[2( P -2 



W 

w 



w 



ty\W) 



+ E 



s 



w 



The UMVU estimator of <r 2 E[p] is pS/n. With respect to the term 



a 2 E 



2(p-2)^l + 4<j ) '(W) 



we use the x 2 identity with an absolutely continuous function g(W) by Efron and Mor- 
ris(1976), 

E[g(W)S) = a 2 E[ng(W) - 2g'(W)W). 
Let g(W) be a solution of the following differential equation of first order, 



ng(W) - 2g'(W)W = 2h(W), 



22 



where 

ft(W) = (p- 2)^ + 2^). 

Then Wan et al. [17] showed that the UMVUE of i?o(^</>) can be written as in (5). The 
general solution of g(W) is written by 



oo 



g(W) = W^J w r« • ^dt + Co} , (30) 

where Cq is a constant. We note that g{W) is required to be absolutely continuous. 
When h(W) is continuous, g(W) is also continuous. From the assumption <f)(W) < oo 
and 4>'{W) < oo, E[p(VT)] exists if and only if Cq = 0. However in the case of 5j S , h(W) 
is not continuous and g(W) with Cq = is also not continuous. We can easily show that 
g(W) is absolutely continuous and -Efg^W 7 )] < oo if and only if Cq satisfies 

-n/2 

iiW< P ~ 2 



( p 


P~ 




(v- 


2 ) 


\ n 


n + 


2 J 


\n + 
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C ={ \n n + 2J \n + 2 J ' ~n + 2 

otherwise. 

By following Wan et al.[17] and the above argument, the derivation of (10) is similar and 
omitted. □ 



B Proof of Theorem 2 

The proof is similar to the proof of Theorem 1. Let i/j*(W) be the function which minimizes 
E[(R*(ip; 8$) — R*{8 (j> )) 2 \ W]. Then 



n 


E[S/a 2 | 


W] R*(6^) 


1 


V 


E[{S/a 2 ) 2 


| W] a 2 


a(W) 


n 


1 + w 


a P ,n.(^) 





p n + p + 2 a(W) 
< n 1 + WpsJJU) a P ,n(^) 
~ p n + p + 2 a(W) 

By using the similar argument in the proof of Theorem 1, we can show that R*(ipi, 6$) 
dominates Rq(S^). □ 

C Proof of R^tpDdjs) > 



Since a p , n (5js) < p, it suffices to show that W p , n (5js) < (p + 2)/n. In the case of Sjs, 
a(W) = n(p-2) 2 /(p(n+2) 2 W) from (6) and a P:n (6.j S ) = (n(p-2))/(n+2)(e.g. Kubokawa 
and Srivastava[13]). Hence (23) is equivalent to 

n + p + 2 p — 2 



C(W) := W(l + W) 



n n 
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Since C(W) is increasing for W > and 

p + 2 



n 



p + 2 n+p+2 > n+p+2 p-2 



ii 



n 



n 



n 



we have W P)n {8 JS ) < (p + 2)/n. 



□ 



D Proof of Theorem 3 

For an estimator -R*(^), denote D^R*^^)) = E[L(R*(8 ( j ) ); R*(8 ( f > ))]. It suffices to show 
that when tf>(-) satisfies the condition of the theorem, D(Rq(5^)) — D(R (ip; 5$)) > 0. By 
using the same argument as the proof of Theorem 3 in Kubokawa and Srivastava[13], we 
have 



d f f P S Y „2/tta /2/Uxm n /V<S 



D(i2S(^))-D(ir(^;^)) 

- | a 2 (iy)^ 2 (W) - 2 



> E 
= 2p 



eft I V n 



n 



a{W)ip{tW)R*{8^) \dt 




n 



a(W)a p , n (8^) \Wtf{W)dt 



P 



n 



va 2 (x/v)ip(w) — a{x/v)a p>n {8^) 



E' 



2p 



> — I E v 

n 



> inf I ^ • E l: 

w n 



x i/)'(w)f p (x; A) f n (v)dxdvdw 
9i(x/v)f p (x; X)dx 

P Jq W a 2 (x/v)f p (x; \)dx 
n £ w a(x/v)f p (x] \)dx 

P 



x 



■ vij){w) - ap,n{fy) 



a(x/v)f(x; X)dz ■ < - ■ va(w)ip(w) - a Pjn (fy) 



n 



P 



a(x/v)f p (x; X)dz ■< — ■ va(w)ip(w) - ap, n (8<j,) 



n 



<p'(w)dw 
cj)'{w)dw 
<f>'{w) 



(31) 



where E v [-] denotes the expectation with respect to v ~ Xp- The first inequality follows 
from the fact that > R*{8<p). The second equality follows from the transforma- 

tion, (x, v,w) = (WS, S, tW) with dtdWdS = x~ x dxdvdw. The second inequality follows 
from the nonincreasingness of a(-). Since a(-) is nonincreasing, j Q a(x/v)f(x; \)dz is non- 
decreasing in v. With respect to q{v) = (p/ n)va(w)(f)(w) — , there exists v such 
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that q(v) < for < v < vq and q(v) > for v > v . Then we have from Kubokawa[12] 



E" 



o 



P 



a(x I v) f p (x; X)dz ■ < — • va(w)ip(w) — a Ptn (6^] 



n 



VqW 



> I a(x/v)f p (x; \)dz ■ E v — ■ va(w)i{j(w) — a Pin ,(<5^) 
Iq in 



P 



If ijj(-) satisfy the condition of the theorem, 



E' 



- ■ va(w)ip{w) - a pn {S^) = pa(w)ip(w) - a pn {8^) > 0. 
n 



Therefore we can complete the proof. 



□ 



E Proof of Theorem 5 

Define M I (6^ ) ) = a~ 2 M(8^) and Mf (<^) = I p — a~ 2 M(8 <t> ). Before we give a proof, we 
present a lemma required to prove the theorem. 

Lemma 1. j3p]l satisfies 



By following Section 2.2 in Kubokawa and Srivastava[13], the proof of this lemma is 
easy and omitted here. 



Proof of Theorem 5. Let £*(•) and r]*(-) be the functions £(•) and r/(-) which minimize 
E[tr (M(£,V, fy) - M(^)) 2 I W\. Similar to (16), we have 



C{W) 



1 (\ 



gi (W) \n E[(S/a 2 ) 2 \W] 
1 fl E[S/a 2 \W] 



• tr < E 



-( 

a 2 \ lp 



XX' \ Mj(^) 
||X|| 2 J ' p-1 



w 



gi (W) \n E[(S/a 2 ) 2 \W] 
1 



E[(S/a 2 ) 2 \W] 



■E 



p — 1 



w 



T]*{W) 



g x {W) \n 



w 



E[(S/a 2 ) 2 \W] 



• tr < E 



q XX' 

2 'va~ 2 ' M/W 1 w 



a 
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We note that £*(•) does not depend on rj(-) and that ??*(•) does not depend on £(■). From 
(17), Lemma 1, the assumption that (3^1(8^) > and the facts that 

XX' XX' 
I p - — = T'(I p -E 11 )r>0, — >0, M/(^)>0. (32) 

we have 

1 n 1 + ,n <m< 1 



<7i(W) \ ra n + p + 2/ ng 1 (W) 
V*(W)<^—(- + g 3 (W)). 



9i(W) \n 

By using the same argument as in the proof of Theorem 1, we can complete the proof. □ 



F Proof of Theorem 6 

We present a lemma required to prove the theorem. 

(2) 

Lemma 2. (3 p< h satisfies 



TO**) > tri 



XI 



(p - l^M) > tr fl p - ^\ ■ M*{8 4 



By following Kubokawa and Srivastava[13], the proof of this lemma is also easy and 
omitted here. 



Proof of Theorem 6. The proof is similar to the one of Theorem 5. Let £*(•) and ??*(•) be 
the functions £(•) and r/(-) which minimize E[tr(Af*(£, rj\ 6^) — M*(S^)) 2 \ W\. Then we 
have 



C{w) 



i 



i 



9l (W) E[(S/o*)*\W] 



tr<E 



-( 



XX' \ M* I {8 4 
W )' p-1 



W 



rf{W) 



9s(W) + 



9l (W) ^ ' E[(S/a*y\W] 
From (17), (32) and Lemma 2, we have 



tr ^ E 



- 2 -^f-MKS,)\W 



a 



W 



n + p + 2 ' gx{W) 



B {2) 



rf{W) < 



9i(W) 



9s(W) 



n + p + 2 



0, 



(2) 
p.n 



Hence by using the same argument as in the proof of Theorem 1, we can complete the 
proof. □ 
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G Proof of Theorem 7 



For an estimator M*(d ( j > ), denote D(M*(5<f > )) = E[L(M*(tf ); M*(S <i> ))]. Denote M*(£, 77; ^ 
with 77(1^) = 1 by -M*(£; <5</,). We first show that when £(•) satisfies the conditions of the 
theorem, M*(£; 6$) dominates Mq(S^). In the same way as (31), we have 



D(M*(S^))-D(M*(C;S^)) 
= E 



00 j 

T A(p-l)S 2 gl(W)e(tW) 



1 dt 



- 2S gi (W)Z(tW)tr[T'(I p - E n )TM*(S^}}dt 



> (p- l)a 4 E 
= 2(p- l)a 4 E 



{(S/a 2 ) 2 ^)^) - (5/a a )^ 1 (W0/S£ , iW}e , (W)* 



E r 



gi(x/v)f p (x; X)dx 



x 



Jo 9Kx/v)f p (x; \)dx 



> / E 



/„ gi(x/v)f p (x)X)dx 
9i{x/v)f p (x; \)dz ■ {v 2 g x {w)i(w) - v^ 2) n (S^)} 



£'(w)dw 



£'(w)dw. 



(33) 



The first inequality follows from Lemma 2 and the second inequality follows from the 
nonincreasingness of g(-). Similar to the argument in the proof of Theorem 2, if £(W) 
satisfy the conditions of the theorem, we have 

EV^H^H - = n(n + 2) 9l (w)Z(w) - n$?>fo) > 0, 

and hence M"*(£; <5^) dominates M"q (5^). 

Next we show that when rj(-) satisfies the conditions of the theorem, M*(£, 77; 5$) 
dominates M*(£; 6$). In the same way as the above argument, we have 

D(M*(£; d<j,)) — D(M*(£, 77; 6$)) 
d 



E 



dt {S 2 gl(W) V 2 (tW) - 2S 2 9l (W) L(W) + ) V (tW) 



2Sg 1 (W)r](tW)tT[T'E u TM*(S^]}dt 
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> a 4 E 



(S/a 2 ) 2 g 2 (W)r) 2 (tW) - 2(S / a 2 ) 2 9l (W) ( g 3 (W) + ) V (tW) 



W 



-2(S/a 2 fg 1 (W)r ) (tW)^l(S <t> )\dt 



2a 4 E 



(S/*ygi(W)r](tW) - (S/ay gi (W) g 3 (W) + 



<P 2 {W) 

w 



poo 




/ E^ 




/o 


Jo I 



- (S/a^W)^ 2 ^) \Wrj\tW)dt 
v 2 gl{z/v)r){w) 



^(2; A)dz 



9i(z/v) 



rj'(w)dw 



>2a 4 / E 



9l{z/v)f p {z] \)dz 



X < V 2 T](w) — 



v 2 93{w) + rfn(fy 



r]'(w)dw 



(34) 



The first inequality follows from Lemma 2 and the second inequality follows from the 
nonincreasingness of gi{W) and the nondecreasingness of g$(W) / g\{W) ■ By using the 
same argument in the proof of Theorem 3, if t](w) satisfies 



rj{w) > min 1, 



/3p 2 l(8,f 



gi(w) n + 2 gi(w) 
the right hand side of (34) is nonnegative. 



□ 
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